Picture for Björn W. Schuller

Björn W. Schuller

EIHW -- Chair of Embedded Intelligence for Health Care and Wellbeing, University of Augsburg, Germany, GLAM -- Group on Language, Audio, and Music, Imperial College London, UK

A Pilot Study on Curator-Guided Multilingual Art Description for Blind and Low-Vision Audiences with Small Vision-Language Models

Add code
May 29, 2026
Viaarxiv icon

CoarseSoundNet: Building a reliable model for ecological soundscape analysis

Add code
May 21, 2026
Viaarxiv icon

AffectSpeech: A Large-Scale Emotional Speech Dataset with Fine-Grained Textual Descriptions for Speech Emotion Captioning and Synthesis

Add code
Apr 05, 2026
Viaarxiv icon

How Class Ontology and Data Scale Affect Audio Transfer Learning

Add code
Mar 26, 2026
Viaarxiv icon

Affect Decoding in Phonated and Silent Speech Production from Surface EMG

Add code
Mar 12, 2026
Viaarxiv icon

Quantifying Dimensional Independence in Speech: An Information-Theoretic Framework for Disentangled Representation Learning

Add code
Feb 24, 2026
Viaarxiv icon

Cross-Dialect Bird Species Recognition with Dialect-Calibrated Augmentation

Add code
Sep 26, 2025
Viaarxiv icon

Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation

Add code
Sep 09, 2025
Figure 1 for Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation
Figure 2 for Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation
Figure 3 for Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation
Figure 4 for Affine Modulation-based Audiogram Fusion Network for Joint Noise Reduction and Hearing Loss Compensation
Viaarxiv icon

Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study

Add code
Aug 25, 2025
Figure 1 for Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study
Figure 2 for Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study
Figure 3 for Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study
Figure 4 for Speech-Based Depressive Mood Detection in the Presence of Multiple Sclerosis: A Cross-Corpus and Cross-Lingual Study
Viaarxiv icon

I$^2$S-TFCKD: Intra-Inter Set Knowledge Distillation with Time-Frequency Calibration for Speech Enhancement

Add code
Jun 16, 2025
Viaarxiv icon